Mapping the Structure of the American Blogosphere
نویسندگان
چکیده
This research project presents an effort to use entry texts and hyperlinks in personal weblogs to observe the variance of local culture reflected in American cities. By geocoding the blogosphere, this project indexes the location of personal weblogs. The hyperlink network among blogs in American cities is presented. Finally, maps of two keywords’ distribution among American cities are plotted. Geocoding the Blogosphere We draw from the NITLE census (November 2003) 952, 626 weblogs registered in the U.S., and check their geographical location using various text mining methods: reviewing the bloggers' ICBM meta tags; city locations inferred from local weather information linked from the blogs’ index pages; the blogger profiles at hosted logs; profiles on “Blogchalk,” a major commercial index of weblogs; data from DNS registrations; and from other keywords on the blogs' index pages. The success rate of retrieving geographical information (specified to national level for non-US blogs, and city level for US blogs) is higher (about 60%) for self-hosted blogs than for blogs on hosting services (about 30%) (Lin & Halavais, 2004). A total of 188,533 are identified with city locations in the United States, and they are indexed by their three-digit zip codes. The distribution of US blogs is plotted to a map of the United States (figure 1). Weblogs were located in a total of 890 three-digit zip code units, and 166 of these units contain more than 300 weblogs from the sample. The size of the circle indicates the size of the blogger population. There is a strong correlation (r=.755) between the number of bloggers and populations of 3-digit zip code units (Lin & Halavais, 2005a). City Networks of Blog Links Drawing a subset of 4,241 weblogs from the above sample, this project extracts the outward links of these weblogs. A total of 632 U.S. city/region units represented by first three-digit US zip codes are taken as nodes of the network. In total, 41,212 permanent links from blogs of each of the city units are counted as the weighted arcs in the network. The bigger circles presented in figure 2 indicate larger numbers of in-links from other city units, while line thickness indicates the totality of hyperlinks. Figure 1: Blog distribution in American cities (each circle represents one 3-digit zip code) (Lin, Halavais, & Zhang, 2005, Lin & Halavais, 2005b). This research finds that weblog networks in America are well connected among metropolitan cities on the west and east coasts. Cities with cultural-political prominence, like Boston, San Francisco, New York, Washington DC and Los Angeles, traditionally the seedbeds of intellectual dialogue and national opinion leaders, forge a highly connected cluster in the center of the national networks. Figure 2: Blog link networks among cities
منابع مشابه
Methodologies for Mapping the Political Blogosphere :
This paper explores methodologies for using the IssueCrawler research tool to map the interconnections of individual blogs in sections of the blogosphere. It uses the case of Australian-born Guantanamo detainee David Hicks as a case study, mapping the distributed discussions of this case in that part of the Australian blogosphere which is concerned with debating news and politics. Its findings ...
متن کاملMapping the Blogosphere in America
This short paper constitutes the first phase of a long-term project focused on probing American urban culture by examining the hyperlinks and text of personal weblogs. It discusses methods of extracting geographic location information from weblogs and ways of indexing weblogs to city units. After a brief introduction to the broader research plan, the paper proposes a process to automatically ex...
متن کاملSecond Space: A Generative Model for the Blogosphere
Analysing complex natural phenomena often requires synthesized data that matches observed characteristics. Graph models are widely used in analyzing the Web in general, but are less suitable for modeling the Blogosphere. While blog networks resemble many properties of Web graphs, the dynamic nature of the Blogosphere, its unique structure and the evolution of the link structure due to blog read...
متن کاملSecond Space: Generative Model for the Blogosphere
Analysing complex natural phenomena often requires synthesized data that matches observed characteristics. Graph models are widely used in analyzing the Web in general, but are less suitable for modeling the Blogosphere. While blog networks resemble many properties of Web graphs, the dynamic nature of the Blogosphere, its unique structure and the evolution of the link structure due to blog read...
متن کاملExistence Results of best Proximity Pairs for a Certain Class of Noncyclic Mappings in Nonreflexive Banach Spaces Polynomials
Introduction Let be a nonempty subset of a normed linear space . A self-mapping is said to be nonexpansive provided that for all . In 1965, Browder showed that every nonexpansive self-mapping defined on a nonempty, bounded, closed and convex subset of a uniformly convex Banach space , has a fixed point. In the same year, Kirk generalized this existence result by using a geometric notion of ...
متن کاملAssessment of geostatistical and interpolation methods for mapping forest dieback intensity in Zagros forests
During recent years, oak decline has been widely spread across Brant’s oak (Quercus Brantii Lindl.) stands in the Zagros Mountains, Western Iran, which caused large-area forest dieback in several sites. Mapping the intensity and spatial distribution of forest dieback is essential for developing management and control strategies. This study evaluated a range of geostatistical and interpolation m...
متن کامل